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Abstract 

In this paper we present an alternative separable implementation of L'^ -orthogonal space-time codes (STC) for 
continuous phase modulation (CPM). In this approach, we split the STC CPM transmitter into a single conventional 
CPM modulator and a correction filter bank. While the CPM modulator is common to all transmit antennas, the 
correction filter bank applies different correction units to each antenna. Thereby desirable code properties as orthogo- 
nality and full diversity are achievable with just a slightly larger bandwidth demand. This new representation has three 
main advantages. First, it allows to easily generalize the orthogonality condition to any arbitrary number of transmit 
antennas. Second, for a quite general set of correction functions that we detail, it can be proved that full diversity 
is achieved. Third, by separating the modulation and correction steps inside the receiver, a simpler receiver can be 
designed as a bank of data independent inverse correction filters followed by a single CPM demodulator. Therefore, 
in this implementation, only one correlation filter bank for the detection of all transmitted signals is necessary. The 
decoding effort grows only linearly with the number of transmit antennas. 



I. Introduction 

The combination of space-time coding (STC) with continuous phase modulation (CPM) systems has attracted 
considerable interest. It brings the possibilities of capacity increase [1] and robustness to fading [2] in systems that 
display good spectral and power efficiency [3]. Pioneered by Zhang and Fitz [4], the first STC CPM constructions 
were based on trellis codes. This approach was also pursued by Zajic and Stiiber in [5] for full response CPM, 
further optimized in [6] and extended to partial response CPM in [7]. Bokolamulla and Aulin [8] and Maw and 
Taylor [9] designed STC by splitting the CPM signal in a memoryless modulator and a continuous phase encoder 
(CPE) [10]. While Bokolamulla and Aulin use codes from [11], the latter combines an external encoder with the 
STC and the CPE. However, for these codes the decoding effort grows exponentially with the number of transmit 
antennas. This was partially circumvented by burst-wise orthogonality as introduced by Silvester et al. in [12] and 
by block- wise orthogonality as established by Wang and Xia in [13] [14]. Unfortunately, this latter design is based 
on the Alamouti code [15] and thus is restricted to two transmit antennas. An extension to 4 transmit antennas 
based on quasi orthogonal space-time codes was presented in [16]. 



Mainly motivated by the low complexity of decoding as described in [13] [14], our present contribution concerns 
orthogonal space-time block codes (STBC) for CPM systems. In our previous work [17]-[19], we have been able to 
design L^-orthogonal space-time codes for 2 and 3 transmit antennas which achieve full rate and full diversity with 
low decoding effort. In [17] we generalized the two-antenna code proposed by Wang and Xia [14] from pointwise 
to L^-orthogonality. In [18] we presented the first L^-orthogonal code family, coined Parallel Codes (PC), for CPM 
with 3 antennas. 

In the present paper, we briefly review some of our previous results and generalize them to an arbitrary number 
of transmit antennas. More specifically, for Parallel Codes we present an alternative approach to the encoding 
by splitting the STC CPM transmitter into a conventional CPM modulator and a correction filter bank. While the 
modulator is shared by all transmit antennas, the correction filter bank is specific to each transmit antenna. Therefore, 
the correction filter bank fully characterizes the properties of the code, e.g. orthogonality, diversity and coding gain. 
This simple framework makes it possible to readily design L-^ -orthogonal Parallel Codes for an arbitrary number 
of transmit antennas and we prove that full diversity is achieved with these codes. 

Again, by separating the demodulation and inverse correction steps at the receiver side, a simple receiver is 
designed as a data independent inverse correction filter bank followed by a single decorrelation unit. In this 
implementation, only one decorrelation unit for the detection of all transmitted CPM signals is necessary. The 
overaU decoding effort grows only linearly with the number of transmit antennas. 

The remainder of the paper is organized as follows. In Section [III we present our new code representation, 
show that full diversity is achieved and give a condition to obtain -orthogonality for an arbitrary number of 
transmit antennas. In Section Hill we introduce a fast decoding algorithm for Parallel Codes. In Section |iy] the 
code performance and the decoding algorithm are evaluated by simulations and finally, in Section |Vl we conclude 
this paper. 



In this section we develop a simplified representation for L'^-orthogonal PC and prove that PC with linear phase 
correction functions provide full diversity. Finally, we give a condition to obtain L^-orthogonal codes. 

A. System Model and Code Structure 

Let us briefly introduce our model for the CPM transmitter with Lt transmitting antennas. We adopt the block 
structure from [18] and accordingly we define the CPM signal for blocks of Lt symbol intervals. The CPM 
block of length LfT is given by [3] 



for ILfT < t < {I + l)LtT. Here, Eb is the block energy, T is the symbol length, 7 the CPM memory length, 
h = niQ /p is the modulation index with niQ and p relative primes and di is the data symbol taken from the set 

Qd = {-M + 1, -A/ + 3, . . . , M - 3, M - 1}. 



II. Generalized code representation 
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Fig. 1. Block diagram of the transmitter and receiver for STC CPM using the generalized code representation 

For convenience, the data symbols of the current block I are collected in the vector d = ciiLt+i 
The phase pulse q{t) is a continuous function with q{t) = for i < and q{t) = 1/2 for t > jT and the 
accumulated phase 

^(0 = - > ; (2) 
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sums all Lt symbols reaching 1/2 till the end of the previous block. 

The family of L^-orthogonal codes proposed in [18] allows to send Lt CPM signals over the transmit antennas. 
The signal sent by each antenna is further modified by an additional correction function. Here, we present a new, 
generalized representation for Parallel Codes, a member of the L'^ -orthogonal code family. These codes use the 
same CPM signal s(i, d) for each antenna and only the correction function Cm(<) differs for each antenna m. 
Consequently, we rewrite the vector of the transmitted signals s{t, d) and obtain the new representation 

s(i,d) = s(t,d)c(t) = s{t,d 
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Figure [T| illustrates the single CPM modulator and Lt data independent correction functions for each transmitter 
antenna. To maintain the constant amplitude of the CPM signal, the correction functions modify only the phase, 
i.e. 

Cmit) = exp {j2Tr(f>cm{t)) , (4) 



where the design of 0cm (t) will be described in the following. 



B. Diversity 

For convenience, we assume a receiver equipped with only one antenna but the extension to multiple antennas 
receivers is straightforward. The channel between the ni*"^ transmitting and the receiving antenna is characterized by 
the channel coefficient hm- All channel coefficients are assumed to be mutually independent, block- wise constant, 
Rayleigh distributed random variables. Furthermore, we assume perfect channel state information (CSI) at the 



receiver and corruption by complex additive white Gaussian noise n{t) (AWGN). Then the received signal follows 



r{t,d) =h"^s(i, d) + n(i) 
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To characterize STC with linear modulations, a signal matrix Cs was introduced in [2]. This matrix results from 
the correlation of all the possible differences of code words. To achieve full diversity, Cs ought to be full rank. It 
was shown by Zhang and Fitz [20] that for nonlinear modulation, i.e. CPM here, the signal matrix should now be 
defined over waveforms, as 

A(i)A"(t)dt, (7) 

where A(t) is the difference between two transmitted signals modulated by different data symbols d and d 

Ai{t) 

A{t)= ■■ -s(i,d)-s(t,d). (8) 

Proposition 1 from [20] shows that has full rank if and only if vJ A.{t) ^ for all vectors u G C^', except 
u = 0. This means that the waveforms of the transmitted signals have to be linearly independent. By applying Eq. 
(|3]l we obtain the diversity condition 

uT(s(i,d)-s(t,d))c(t)^0. (9) 

Now, since s(f, d) and s(i, d) are different for at least one symbol, their difference is never zero for all t within a 
block. Thus, Eq. (|9]l simplifies to 



it if 
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which only depends on the correction function Cm{t)- A large class of functions fulfill Eq. (fTol i. In the following, 
we focus only on correction functions with linear phase. Thus we define parametrized phase functions as 



, , m — 1 

(Pcm[t) = ———at + fim, 

where /3,„ is a constant phase offset and a is a nonzero slope. Now, u^c(t) = would imply that 
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Introducing the polynomial 
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Eq. ( fT2] l would mean that p(e^^'^"*/(^'^)) = 0. This would imply that the polynomial p{x), of degree Lt — 1, 
vanishes on more than Lt different points. Thus, p = and Um ~ for all m. Consequently, by [20, Prop. 1], the 
signal matrix has full rank and all the codes achieve full diversity. 

The linear phase correction functions are similar to the idea of tilting phase as proposed by Rimoldi [10]. However, 
the purpose of tilted phase in [10] was to simplify the states of single input single output CPM systems. Here, the 
phase drifts are introduced to achieve -orthogonality between transmit antennas. Therefore the tilt angle (i.e. the 
slope of the linear phase function or the phase shift) has a quite different role in the two approaches. 



C. Orthogonality 

With the new representation of CPM introduced in Section III-AI we derive the orthogonality condition for an 
arbitrary number of transmit antennas. -orthogonality is imposed by [18] 
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where I is the Lt x Lt identity matrix. Due to the constant amplitude of the CPM signal, orthogonaUty depends 
only on the correction functions. By Def. (|4|, Cm{t)c1^{t) ~ 1. So, we only need to cancel all the crosscorrelation 
terms and get 
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for m ^ m'. To fulfill Eq. ( fTTI ) we have to integrate over full rotations on the unit circle. Therefore, a needs to be 
an integer In the following we set a = 1 for two reasons: 

1) Minimizing bandwidth: The correction function causes a frequency shift depending on the slope of the phase. 
To minimize the overall bandwidth of the system the frequency shift needs to be small. Hence, the phase 
slope of the correction function is required to be minimal. 
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Fig. 2. Merging of inter- and inner-block trellis for simplified detection with I = 1 (pps - paths per state) 



2) Equivalence to linPC [18]: If a = 1 Parallel Codes with linear phase function coincide with the linPC family 
proposed in [18]. The phase offsets in Eq. ( fTTT i correspond to the initial phases of the linPC. 

III. Fast Decoding Algorithm 

In this section we provide a simplified decoding scheme for the proposed parallel codes. For convenience, we 
assume only one receive antenna (Lr = 1) but the extension to multiple antennas is straightforward. 

The received signal r(t, d) is a superposition of the transmitted CPM signals which are weighted by the channel 
coefficients. Due to the CPM inherent continuous phase encoder (CPE) [10], the received signal consists of Lj 
superposing trellis codes. These are generally quite hard to decode. 

To reduce the complexity of the decoder we first consider the block structure of the proposed STC. This facilitates 
the splitting of the super trellis into an "inter" and "inner-block" trellis as shown in Figure |2l To achieve full rate, 
each block contains Lt symbols with an alphabet size M which are distributed over the ST-block. Therewith each 
state of the inter-block trellis has Af^' leaving paths, i.e. in order to calculate all block distances ^'^(d, d|6(fc)), 
M^' matched filter of length LtT have to be applied pAP^"^ times. This exponential growth of complexity with 
the number of transmit antennas makes the application in real world systems impossible. 

Eq. ( fTSl l shows the non-simplified maximum likelihood metric for the inter-block trellis. The absolute value 
contains all Lt data symbols. Therefore we have to consider all crosscorrelations between the data symbols which 
gives us the previously mentioned A/^* path metrics. By using the L^-orthogonality from the previous section those 



correlations are canceled out and we get 

LtT J 
/* 2 

" ~ dt (18) 
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Here it can be seen that for nonlinear modulations, -orthogonality is sufficient to decorrelate the signals of 
the transmit antennas. The pointwise orthogonality of the orthogonal codes used in linear modulations is also a 
sufficient condition to simplify Eq. ( fTsT l. But this would impose stronger restrictions upon the STC. 

Eq. ( fT9] l implies that Lt conventional CPM signals have to be decoded. Hence, the complexity grows only linearly 
with the number of transmit antennas, i.e. due to decorrelation of the transmitted signals, each data stream from 
one transmit antenna can be decoded separately. Alternatively, the ML metric can be transformed into a correlation 
between the received signal r{t, d) and an hypothetical version of this signal. We get an equivalent correlation 
based metric by 

DB{d,d\Qik)) = J2 / ^e{rit.d)h*^cUt)s*it,d)}dt. (20) 

By spUtting the correction filter Cjn{t) from the conventional CPM signal s{t,d), we define a pseudo received 
signal as 

Lt 

x{t,d)=rit,d)J2Kn<M- (21) 



This signal corresponds to a single preprocessed CPM signal which is decoded by 

LtT 

L»B(d,d|e(fc)) = J Re|2;(t,d)s(t,d)|dt. (22) 



Hence, only one CPM signal has to be decoded and we obtain a single inner-block trellis which is shown at the 
bottom of Figure |2] The metric to compute the symbol-wise distances at time slot r is given by 

rT 

D{d,d\e'{k)) ^ J Re|a;(i,d)s(t,d)|dt. (23) 

(r-l)T 

This additional complexity reduction is accomplished due to the parallel structure of the proposed code. Finally, 
since a = 1, the phase drift per block is always an integer (0 ... — 1) and therewith c,„(0) — (^Cm{LtT)^ 
Thus, the accumulated phase memory 6(1) at the beginning and the end of each block is defined over the same 
set of rational numbers, i.e. fig = {0, 1/p, . . . ,{p — The states of the inner trellis at the end of one block 

Q{k) and the beginning of the next Q'{k) are consequently equal and inner and inter-block trellis can be merged 
to one trellis. The new block independent treUis is equivalent to the trellis of one conventional CPM signal. This is 
consistent with the model we use for the modulation where only one CPM signal is modulated and the signals for 
the different transmit antennas are created by the phase correction functions. One can look at the phase correction 
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Fig. 3. BER for PC CPM with M = 4, 2REC, h = 0.5 and h = 0.8 



filters (phasers) of the transmitter, the physical channel and the dephasers of the receiver as a single input single 
output pseudo channel. This pseudo channel benefits from the full diversity introduced by the correction filters 
whereas at the transmitter and receiver only a conventional coder and decoder are necessary. 

Finally, it should be noticed that the complexity of the proposed receiver can be further decreased. Namely, 
methods proposed in literature ( [21]) can be additionally applied to the CPM decoder. 

IV. Simulation Results 

In this section, we evaluate the proposed transceiver implementation and the performance of the code by means of 
simulations. For all our simulations, we use a linear phase pulse with a length of 2T (2REC) given by q{t) = t/AT 
for < t < 2T, q{t) = for t < and q{t) = 1/2 for t > 2T. An alphabet of size A/ 4 with fid = {-3, -1, 1, 3} 
is used. Further, we assume blockwise transmission with block length Lf, — 130. The channel coefficients hi have 
Rayleigh distributed amplitude and uniformly distributed phase. They are assumed to be constant during one block 
length LbT and the receiver has perfect knowledge of those coefficients. 

As stated earlier, the complexity of the most costly part of the decoder, the MLSE, is independent of the number 
of transmit antennas. In our case, the trellis has always pM = 16 states with M = 4 paths originating from each 
state. That means that we have to evaluate only 64 path weights per symbol and 64Lt per block. This is valid not 
only for one but also for three transmit antennas. In contrast, for Lt = 3, a non-simplified receiver would have 
had to evaluate pMM^* — 1024 paths per ST-block. For the proposed scheme only the size of the correction filter 
bank grows with the number of transmit antennas. Hence, the decoding effort grows only linearly with the number 
of transmitting antennas. Moreover, this filter bank needs to be evaluated only once per symbol. 



Figure |3] shows our simulation results for two different modulation indexes; h = 0.5 and h = 0.8. A larger 
modulation index increases the distance between two symbols and improves therewith the BER. The drawback of 
this improvement is an increased bandwidth. As expected, the simulations in Figure [3] show that the BER of the 
proposed STC CPM schemes also benefits from a larger modulation index. Further, the diversity gain becomes 
clearly visible. The slope of the BER curves increases with a growing number of transmit antennas. 

For the second group of simulations (h — 0.8) the decoding complexity increases slightly due to the modified 
modulation index. The trellis has now pM = 20 states and we have to calculate 80 path weights per symbol. The 
complexity of the correction filter bank remains unchanged. 

V. Conclusion 

In this paper, we have presented a novel representation for -orthogonal Parallel Coded CPM. This representation 
decouples the data-dependent CPM modulator from the antenna-dependent correction filter bank and enables the 
generalization of the L'^-orthogonal Parallel Codes to an arbitrary number of transmit antennas. It is also shown 
that these generalized codes achieve full diversity. 

The main advantage of this representation arises at the receiver level. The costly maximum likelihood sequence 
estimation, necessary for decoding the CPM [18], is now implemented only once, independently of the number of 
transmit antennas. The full diversity of the system comes from the correction filter bank which is applied only once 
per symbol. Hence, a simplified implementation and a decoding effort that grows only linearly with the number of 
transmit antennas is obtained in exchange for a slightly increased bandwidth for the correction filter. 
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